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Abstract 

The spread of a disease, a computer virus or information is discussed in a directed complex network. We 
are concerned with a steady state of the spread for the SIR and SIS dynamic models. In a scale-free directed 
network it is shown that the threshold of its outbreak in both models approaches zero under a high correlation 
between nodal indegrees and outdegrees. 



1. Introduction 

Recent studies on the spread of diseases, computer 
viruses or information in complex networks have been 
exclusively devoted to undirected ones such as social 
networks or the Internet. In those networks the direc- 
tion of links (or edges) is not important and can be 
ignored. On the other hand, there are important di- 
rected networks in nature and man-made systems such 
as food webs and the WWW, etc. However, studies on 
the spread in directed networks have not been done 
extensively. In order to tackle the problem, [5] made 
good use of the generating function methodology. Our 
approach is different from it and is based on the dy- 
namical mean-field theory of [1, 2, 6]. 

The objective is to study the SIR and SIS models 
in directed networks and to derive the critical infection 
rate or the threshold, above which a disease spreads in 
the networks and below which it dies out. In a directed 
network a disease passes to other nodes through outgo- 
ing links and a node is infected by incoming links. The 
indegree of a node is the number of incoming links into 
the node and the outdegree is that of outgoing links 
emanating from it. 

Many directed networks such as the WWW, net- 
works of metabolic reactions and phone calls have 
power law degree distributions ([4], [8], [10]): 

P{k) oc fc- 7 and Q(£) cx £^' 

for all large indegrees k and outdegrees £, respectively, 
although real networks inevitably have finite sizes of 
k, I. The distinction between both degree distributions 
disappears for an undirected network. If the exponents 
satisfy 2 < 7,7' < 3, these networks are called scale- 
free. Several authors use 7i n ,7out in place of 7,7', 
respectively. 

In [9] it was shown that the threshold of the SIS 
epidemic model in undirected scale-free networks is 
zero. In [1, 2, 6] a similar result was obtained also for 
the SIR epidemic model. 



In this paper, using the joint probability distri- 
bution of indegrees and outdegrees, we derive the 
thresholds for the SIR and SIS epidemic dynamics on 
directed networks. Actually they turn out the same 
for both. Furthermore, it is shown that the threshold 
approaches zero under a high correlation between in- 
degrees and outdegrees. In the SIR model, the average 
fraction of nodes that are ever infected until the disease 
dies out is also given using the indegree distribution. 



2. The SIR model in a directed 
network 

First we investigate the SIR model on a directed 
network. Nodes of the network are divided into the 
following three groups as in [7, Chap. 10]: Susceptible 
(S), Infected (I) and Removed (R). Hereafter we will 
denote a susceptible node by an S-node etc., for short. 
An S-node becomes infected at a rate of A (0 < A < 1). 
The parameter A is the infection rate, for which we will 
derive the critical value for an outbreak of a disease, a 
computer virus, etc. The disease can be passed from 
I-nodes to S-nodes following only the direction of di- 
rected links. R-nodes have either recovered from the 
disease or died and so they cannot pass the disease to 
other nodes. An I- node becomes an R-node at a rate 
S (0 < S < 1) and, without loss of generality, we will 
set 5 = 1. 

Let us denote the densities of S-, I-, R-nodes 
with indegree k and outdegree £ at time t by 
SkAt)> Pk,e(t)i Rk,t{t), respectively. So we have 

S*,* (*)+/>*,<(*) + jR m(*) = L 

Let p(k, £) be the joint probability distribution of 
nodes with indegree k and outdegree £, and let us de- 
note the marginal distributions by 

P(k) =X>(M), Q(*) = 5>(M) 
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and the averages including the nth moments by 

<fc"> = £V P (fc,*) = £Vp(fc), 

k,t k 

(r) = £r P (M) = E r 3W, 

(kt) =^2k£ P (k,£). 

k,e 

Following the dynamical mean- field theory ([2, 6]), 
we see that the spreading process on a directed net- 
work can be described by the system of differential 
equations: 



dSk,( 
dt 

dpk,i 
dt 

dRk,i 
dt 



= -\kS k , t (t)0{t), (f) 
= XkS k ,e{t)0(t) - p k , e (t), (2) 

= PkA*)- ( 3 ) 



The term \kSk,e(t)6(t) in (1) and (2) indicates the frac- 
tion of newly infected nodes through k incoming links. 
The probability, 0(t), that a randomly selected outgo- 
ing link emanates from an I-node at time t is given 

by 



£>(M)pm(*) 



6(t) = 



kJ 



J2e P (k,e) 

k,e 



• (4) 



Note that each directed link is counted twice as one 
outdegree of some node and as one indegree of another. 
Hence the average outdegree is equal to the average in- 
degree: (£) = (k). 

Using the initial condition Sk,e(0) = 1, (1) is easily 
solved as 



SiM(*) = e" Afc * (t) . 



(5) 



where 



(j>(t) = f 6{t')dt'. 
Jo 

By (4) and Rk,e(0) = 0, <j>{t) is expressed as 

= 77V EMM) [ PkAt')dt' 
W ki Jo 



(6) 



We derive the differential equation for (f>(t). Using 



(3), (5) and (6), it follows that 
1 

dt Jej 



k,e 



ki 



= l-#)4E#,<)e- 



\k<t>(i) 



In this paper we are concerned with a steady state 
of the epidemic spreading. At the steady state we will 
have a limit 



$ = lim <p(t), 

t— >oo 



together with the condition 

d(j)(t) n 
lim = 0. 

t^oo dt 

Substituting these into the above equations yields the 
equation for <J> as follows: 



Xk<S> 



(7) 



k,e 



An epidemic outbreak implies that this equation 
has a solution $ > other than $ = 0. Since the right 
hand side of (7) is a concave function of $ and its value 
at $ = 1 is less than 1, the condition for it is 

Hence the critical infection rate A c or the threshold for 
an epidemic outbreak is obtained by setting 



Solving this for A we get 



A c = 



(8) 



In [7, Chap. 10] the total number of infected indi- 
viduals is discussed for the classical SIR model, which 
represents the final outbreak size. In our setting it is 
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the average fraction of nodes ever infected until the 
disease dies out. This is written as 



= - S M (oo)) 

k,e 

= i-^2p(k,e)s k ,t(oo) 



(9) 



k,i k 



\k<S> 



by means of the indegree distribution P{k). 

Suppose that the indegree distribution follows a 
power law 

P(k) oc fc~ 7 , k > m 

with 2 < 7 < 3, then P(k) = (7-l)mT- 1 fe-T (k > m), 
where m is the minimum indegree. Let T(a,x) be the 
incomplete gamma function defined by 



/>OC 

r(a,x) = / t^e^dt. 



Then the fraction (9) of the outbreak size can be writ- 
ten as 

= 1-/ P{k)e- Xk *dk 

J m 

= 1 - (7 - l)(Am$) 7_1 r(l - 7, Am$), 
by the continuous approximation. 



3. The SIS model in a directed 
network 

In the SIS model R- nodes are absent. Those nodes 
that recovered from a disease may be infected again 
and again. The densities of S-, I- nodes with indegree 
k and outdegrec I at time t are Sfc^(i), pk,e(t) as be- 
fore, and the equality Sk,t{t) = 1 — p k ,i{t) holds. So 
equations (l)-(3) are replaced by the single differential 
equation 



dt 



= Xk(l - p k ,t{t))0(t) - Pk ,e(t), 



(10) 



where 9{t) is the same probability as (4). 

At the steady state, as in Section 2, we will have 
the condition 

lim%i=0 

t— >oc dt 

for all k and £ and a limit 



6 = lim 0{t). 

t-foo 



So we get from (10) 



Afc0 

lim pk,i(t) = - . 
t->oo 1 + Afc0 



Substituting these into (4) we have the equation for 
as follows: 



\ke 

+ XkQ' 



If this has a solution > other than = 0, then it 
corresponds to an endemic state. Since the right hand 
side of the equation is a concave function of and 
its value at = 1 is less than 1, the condition for an 
outbreak is 



d 

d& 



1 



Afc© \ 



k,e 



\k@ J 



> 1. 



e=o 



Again, this yields the same threshold (8) as in the SIR 
model: 

A 

c ~ (key 

In the next section we calculate the threshold A c 
in several cases and show that it approaches zero for 
scale-free directed networks under some additional as- 
sumptions. 

4. Correlations between outdegrees and 
indegrees 

Some of real complex networks contains a few hubs, 
that is, nodes with many outdegrees and indegrees, and 
a vast nodes with very few degrees as well. In order to 
discuss such a phenomenon, it is effective to introduce 
the conditional probability p(£\k) for the correlation 
between outdegrees and indegrees. It indicates the 
probability that a given fc-indegree node has I outde- 
grees. 

First we deal with the following two extreme cases 
(I) and (II) by means of p{i\k). (I) has the highest 
correlation, while (II) is the lowest one or independent 
case. 



Case (I) p{i\k) 



f.k 



for all k and i. 



Here Sg^ is the Kroncckcr delta. This condition 
implies that each node has the same in- and out- 
degrees. If we regard k and I as random variables, 
then k = I. In this case we have (fc 2 ) = (£ 2 ) and the 
denominator in (8) is 

(kl) = Y,k£p(k,e) = Y J kl5 tM P{k) = (k 2 ). 



k,e 



Therefore, if the indegree distribution follows a power 
law P(k) oc fc~ 7 with 2 < 7 < 3, then the threshold 
A c in (8) is equal to zero as in [1, 2, 6, 9], where this 
prominent result for the SIR and SIS models was first 
obtained for undirected scale-free networks. 
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Case (II) p(£\k) = Q{£) for all k and L 

In this case the random variables k and £ are inde- 
pendent or uncorrelated and (k£) = (k}(£) holds, from 
which we see that the threshold (8) is 

a - JL- JL 

c (k) (ty 

This expression of the threshold also appears for a ho- 
mogeneous SIS model in undirected networks as in [2] . 

Under the conditions (k 2 ) < oo and (£ 2 ) < oo the 
average (k£) satisfies 

(k£) < ^WW): 

by the Schwarz inequality [3]. Moreover, it also says 
that the equality (k£) = \/(k 2 )(£ 2 ) holds only if both 
random variables k and £ satisfy k = a£ with some 
constant a. From (k) — (£) we see that a = 1, which 
coincides with (I). Thus, according as the correlation 
between indegrees k and outdegrees £ becomes high, 
the threshold A c approaches zero: 

c w ' 

provided the power laws 

P(k) oc fc~ 7 and Q{£) oc r 1 ' for k,£< M, (11) 

hold with exponents 2 < 7, 7' < 3 and the maximum 
degree M is very large. 

In order to discuss more quantitatively, we might 
use the correlation coefficient ([3]): 

r = {{k-{k)){£ -{£))) = (k£)-{k)(£) 

^(k 2 )-(k) 2 ^(£ 2 )-{£) 2 

Here and cr^ are the respective standard deviations. 
It satisfies — 1 < r < 1, which is a variation of the 
Schwarz inequality. By a simple calculation it follows 
that the threshold of (8) can be written as 

Ac = (<*) +r(fc) V /((fc 2 )/(fc) 2 - l){{£ 2 )/{£) 2 1))~\ 



So, if it is possible to find r > by sampling, then we 
have A c w under the above condition (11), because 
(fc 2 ) > (fc) 2 and {£?) > (£) 2 in case of 2 < 7,7' < 3. 
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